Univariate tests

They are used when a single null hypothesis is to be tested. The following tests are available.

Univariate test functions

Test	Function	Alias
Pearson correlation	`correlationTest`	`rTest`
Trend correlation	`trendTest`
Point bi-serial correlation	`pointBiSerialTest`
Student's t for independent samples	`studentTestIS`	`tTestIS`
1-way ANOVA for independent samples	`anovaTestIS`	`fTestIS`
Chi-squared	`chiSquaredTest`	`Χ²Test`
Fisher exact	`fisherExactTest`
Student's t for repeated measures	`studentTestRM`	`tTestRM`
1-way ANOVA for repeated measures	`anovaTestRM`	`fTestRM`
Cochran Q	`cochranqTest`	`qTest`
McNemar	`mcNemarTest`
One-sample Student's t	`studentTest1S`	`tTest1S`
Sign	`signTest`

You may also find useful the tests we have created as examples of how to create new tests:

Test
Autocorrelation
Chatterjee correlation
Distance correlation

For creating other tests, see Create your own test.

For multiple comparisons tests, see Multiple comparisons tests.

Common kwargs for univariate tests

The following optional keyword arguments are common to all univariate test functions:

direction: an instance of TestDirection, either Right(), Left() or Both(). The default is Both().
equivalent: a boolean. If true (default), the fastest equivalent statistic will be used. See Statistic.
nperm: an integer providing the number of random permutations to be used for an approximate test. It defaults to 20000.
switch2rand: an integer setting the upper limit of permutations to be listed exhaustively. It defaults to 1e8. If the number of possible permutations exceeds switch2rand, the approximate test with nperm random permutations will be performed, otherwise an exact test with all possible permutations will be performed. In order to force an approximate test, set switch2rand to a small integer such as 1. In order to know in advance the number of possible permutations, see nrPerms.
seed: an integer. It applies only to approximate tests. Set to 0 to use a random seed for generating random permutations. Any natural number results instead in a reproducible test. It defaults to 1234.
verbose: a boolean. Print some information in the REPL while running the test. Set to false if you run benchmarks. The default is true.

Correlation test

PermutationTests.correlationTest — Function

function correlationTest(x::UniData, y::UniData;
            direction::TestDir = Both(),
            equivalent::Bool = true,
            switch2rand::Int = Int(1e8), 
            nperm::Int = 20_000, 
            seed::Int = 1234, 
            standardized::Bool = false, 
            centered::Bool = false,
            verbose::Bool = true) where TestDir <: TestDirection

Univariate Pearson product-moment correlation test by data permutation. The null hypothesis has form

$H_0: r_{(x,y)}=0$,

where $r_{(x,y)}$ is the correlation between the two input data vectors, x and y, typically real, both holding $N$ observations.

For optional keyword arguments, direction, equivalent, switch2rand, nperm, seed and verbose, see here.

If standardized is true, both x and y are assumed standardized (zero mean and unit standard deviation). Provided that the input data is standardized, the test provides the same p-value, however it can be executed faster as in this case the cross-product is equivalent to the Pearon r statistic (see Statistic).

If centered is true, both x and y are assumed centered (zero mean). The test provides the same p-value, however it can be executed faster if the test is bi-directional as in this case the equivalent statistic, the covariance, reduces to the cross-product divided by N.

If neither standardized nor centered is true, the data will be standardized to execute a faster test using the cross-product as test-statistic.

Directional tests

For a right-directional test, the correlation is expected to be positive. A negative correlation will result in a p-value higehr then 0.5.
For a left-directional test, the correlation is expected to be negative. A positive correlation will result in a p-value higehr then 0.5.

Permutation scheme: under the null hypothesis, the position of the observations in the data input vectors bears no meaning. The exchangeability scheme consists then in shuffling the observations of vector x or vector y. PermutationTests.jl shuffles the observations in x.

Number of permutations for exact tests: there are $N!$ possible ways of reordering the $N$ observations in x.

Aliases: rTest!, trendTest!

Multiple comparisons version: correlationMcTest!

Return a UniTest structure.

Examples

using PermutationTests
N=10 # number of observations
x, y = randn(N), randn(N) # some random Gaussian data for example
t = rTest(x, y) # by deafult the test is bi-directional

tR = rTest(x, y; direction=Right()) # right-directional test
tL = rTest(x, y; direction=Left()) # Left-directional test
# Force an approximate test with 5000 random permutations
tapprox = rTest(x, y; switch2rand=1, nperm=5000)

Similar tests

Typically, the input data is real, but can also be of type integer or boolean. If either x or y is a vector of booleans or a vector of dicothomous data (only 0 and 1), this function will actually perform a permutation-based version of the point bi-serial correlation test. However, as shown in the preceeding link, the point bi-serial correlation test is equivalent to the t-test for independent sample, thus it can be tested using the t-test for independent samples, which will need many less permutations as compared to a correlation test for an exact test (see examples below). A dedicated function in available with name pointBiSerialTest, which is an alias for studentTestIS and allowa the choice to run the test using a correlation- or t-test statistic.

If x or y represent a trend, for example a linear trend given by [1, 2,...N], we otain the permutation-based trend correlation test, which can be used to test the fit of any type of regression of y on x - see trendTest.

if y is a shifted version of x with a lag $l$, this function will test the significance of the *autocorrelation at lag $l$, see the page Create your own test.

Examples

# Point bi-serial correlation test
using PermutationTests
N=10 # number of observations
x=[0, 0, 0, 0, 1, 1, 1, 1, 1, 1]
y = rand(N)
t = rTest(x, y) 

# Exactly the same test can be obtained as a t-test for independent sample,
# but much faster as for an exact test the latter needs only 210 permutations 
# while the former needs 3628800 permutations.
# This is available with a dedicated function
t2=pointBiSerialTest(y, [4, 6])
println(t.p ≈ t2.p ? "OK" : "error")